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ABSTRACT 

We investigate the clustering of Hl-selected galaxies in the ALFALFA survey and com- 
pare results with those obtained for HIPASS. Measurements of the angular correlation 
function and the inferred SD-clustering are compared with results from direct spatial- 
correlation measurements. We are able to measure clustering on smaller angular scales 
and for galaxies with lower HI masses than was previously possible. We calculate the 
expected clustering of dark matter using the redshift distributions of HIPASS and 
ALFALFA and show that the ALFALFA sample is somewhat more anti-biased with 
respect to dark matter than the HIPASS sample. 
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1 INTRODUCTION 

Measurements of the clustering of galaxies allows one to in- 
vestigate the relationship between dark and luminous mat- 
ter. By comparing galaxies selected in different ways one 
gains understanding of how different galaxies trace the un- 
derlying dark matter and also of processes at work in galaxy 
evolution. This information is important when using galax- 
ies as probes of cosmological parameters. 

A number of new radio telescopes, such as the 
MeerKAlO , ASKAlQ and the SKA, are in the pipeline and 
they will detect huge numbers of galaxies using HI. A reli- 
able measure of the bias of Hl-selected galaxies and insight 
into the evolution of the bias is important for forecasting the 
capabilities of telescopes which will probe HI at intermedi- 
ate or high-redshifts. The clustering of H l -selected galax- 
ies ha s bee n studied by Meyer et al.l l|2007l 'l , iBasilakos et al.l 
l|2007h and iRvan-Webeil (|2006l ). The y used data from the 
HI Parkes AU Sky Survey (HIPASS. iMever et aLl liooj), a 
blind survey for HI of the southern sky which generated 
a catalogue of 4315 sources, the bulk of which have red- 
shifts below 2 ~ 0.02. They showed that Hl-selected galax- 
ies are less c lustered than galaxies selected in other ways. 
iMever et al.l (2007) investigated clustering of various sub- 
samples of HIPASS galaxies, showing that galaxies with high 
rotation velocities are more clustered than those with lower 
rotation velocities. There were indications that galaxies con- 
taining more HI are also more clustered but the differences 
were not as pronounced as in iBasilakos et al.. (,2007l ). The 
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latter work also measures the bias of HIPASS galaxies rela- 
tive to the expected dark matter distribution. 

In this paper we measure the clustering of Hl-selected 
galaxies detected with the Arecibo L-band Feed Array 
(ALFA) and compiled in the partially completed AL- 
FALFA survey (the Arecibo Legacy Fast ALFA survey, 
iGiovanelli et al.ll2005h . The results are compared with those 
obtained for HIPASS. Clustering measurements in HIPASS 
are limited to large angular scales where the beam-size of 
~ 15 arcmins does not cause confusion. The ALFALFA res- 
olution is more than four times better allowing us to probe 
clustering on smaller scales. The rms noise per ALFALFA 
beam is about six times smaller, providing a catalogue of 
sources which spans a wider range of redshifts and includes 
galaxies with lower HI masses. We are thus able to measure 
clustering of Hl-selected galaxies in regimes that have not 
yet been explored and to investigate trends seen in HIPASS, 
using an independent survey. 

The outline of the paper is as follows: In § [2] we give a 
short introduction to the HIPASS and ALFALFA surveys. 
The computation of the angular and spatial two-point corre- 
lation functions is described in §[3l The results are presented, 
discussed and compared with earlier work in §[4] Finally, §[5] 
concludes with a short summary. 



2 DATA 

HIPASS covers all the southern sky with S < + 2° and and 
can detect HI with velocities in the range 300 kms"'^ — 
12700 kms~\ The rms noise per beam is ~ 13 mjy 
ijMever et al.ll2004l L To exclude structure associated with 
the Milky- Way, like high- velocity clouds and low mass satel- 
lites, we only use sources with recessional velocities larger 
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Figure 1. Plot of the normalised redshift distribution of the 
HIPASS and ALFALFA surveys as well as the distribution of the 
ALFALFA sources in the three, spatially separated strips. 



than 600km The average mass of HI in HIPASS galaxies 
is 3.24 X IO^Mq. 

When complet ed, the ALFALFA Survey 
l|Giovanelli et al.l l2005t ) will cover 7000 deg^ of sky with 
high galactic latitude and to a depth of cz ^ 18000 kms~^. 
The rms noise of the survey is ~ 2.2 mjy and the beam-size 
is ~ 3.6 arcminutes. Currently the ALFALFA survey 
contains three strips covering a total area of ~ 400 deg'^. 
These are the two strips centred on the Virgo region 
and the anti- Virgo strip. They contain 1796 sources with 
cz > 600 kms~^. The first completed Virgo strip is defined 
by ll'^44'" < Q < 14''00™ and 12° < 5 < 16° and contains 
708 sources. The second Virgo strip contains 5 56 galaxies 
withi n ll''36'" < a < 13''52'" and 8° < 5 < 12° (|Kent et al.1 
|2008| ). The anti- Virgo strip contains 488 sources within 
22^0 0*" < a < 03'' 04™ and 26° < J < 28° (|Saintonge et al.1 
|2008| ). A circular region, with radius of 1° centred on 
M 87, has been remo ved from the survey a rea due to the 
interference of M 87 l|Giovanelh et al.l I2OO7I '). The average 
ALFALFA HI mass is 2.48 x IO'^Mq. The source density of 
the ALFALFA catalogue is approximately 20 times higher 
than that of the HIPASS survey. 

Figure [T] displays the normalised redshift distributions 
of the HIPASS and ALFALFA surveys as weU as the red- 
shift distributions in the three separate ALFALFA regions. 
A redshift of z = 0.02 corresponds to ~ 70 /i~^Mpc which 
is roughly the distance to the Coma supercluster. The high 
galaxy density near the Coma cluster and Virgo cluster are 
evident in the redshift distributions shown for the Virgo 
regions. In the anti- Virgo region the effect of the Perseus- 
Pisces supercluster can also be seen as a slight enhancement 
of galcixies at a redshift of about z ~ 0.025. Below, we will 
discuss the impact of these inhomogeneities on the determi- 
nation of the two-point correlation function. 



3 TWO-POINT CORRELATION FUNCTIONS 

Here we review some basic properties of the angular and pro- 
jected two-point correlation functions {to and H respectively) 
and indicate their relations to the three dimensional (3-D) 
real-space two-point correlation function, ^. Subsequently, 
we introduce the estimator used here and discuss the con- 



struction of the random samples. We do not emp l oy the 
weighted correlation functions used in iMever et al.l (|2007l ) 
as we are interested in comparing the results of the two sur- 
veys and in comparing our results with those predicted for 
dark matter within a ACDM model. The unweighted mea- 
surements suffice for this work and we are able to chec k our 
unweighted results against those of iMever et al.l l|2007t ). 



3.1 The Angular Two-Point Correlation Function 

The angular correlation function, cj, is a simple measure of 
the clustering of galaxies as a function of angular separation 
on the sky, 9, which does not require redshift information. 
It is calculated by counting galaxy pairs within a given an- 
gular separation bin and comparing this number to a corre- 
sponding figure derived from a random catalogue with the 
same area and shape. The angular correlation, ij{9), gives 
the excess probability, over random, of finding two galaxies 
separated by angle 9. 

If we assume a redshift-dependent power law de- 
scribes the 3-D real-space c orrelation fun ctio n, £, (r, z) = 
(r/ro )-'^(l + zy-^^+''^ (as in |Peeble"3ll980l and [Loan et al.1 
Il997f ) then the angular correlation function is related to the 
spatial correlation function by the Limber equation (|Rubinl 
l1954; Limber 1954): 
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(1) 

where d{z) is the comoving distance and N{z) is the redshift 
number density distribution of the sources (cf. Figure [l|. 
We use e = 0.8, consistent with the expected clustering be- 
haviour in linear theory, although the surveys are so shallow 
that the evolution of ^ could be ignored. The measured val- 
ues for the logarithmic slope ae = 1 — 7 and the correlation 
length 60 (aj(6o) — 1) can then be used to determine the 3-D 
parameters ro and 7. 

The errors for cj are calculated using jack-knife re- 
sampling l|Luptonll 19931 ). For this purpose the data are split 
up into A'^ RA-bins and the correlation function is recalcu- 
lated repeatedly each time leaving out a different bin. Thus a 
set of A'' values {uji, i = 1, N} for the correlation function 
are obtained and the jack-knife error of the mean, o-^i^^^^, 
is given by 



(2) 



The HIPASS sample has been divided into 24 RA bins 
while for the ALFALFA catalogue we use 12 bins such that 
each bin contains approximately the same area of the sky. 



3.2 The Projected Two-Point Correlation 
Function 

The projected correlation function, H((t), is determined by 
the number of pairs at given radial and projected separa- 
tions, TT and a, and a subsequent integration along the ra- 
dial direction. For that purpose the absolute radial distance 
between a pair of galaxies, vr = \{vi — Vj)/Ho\, and their an- 
gular separation, 6, are converted into a projected distance, 
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a = [{vi + Vj)/Ho] tan{e/2). Thus, 
H(a) _ 2 



10 



(3) 



where Dumit is the hmit where the integral converges. Here 
we set Diimit = 25 /i~^Mpc ~ 2500 kms"'^. The projected 
correlation functi on is related to the re al-space correlation 
function by (e.g.. lDavis fc Peebles! [l983 ): 



rdr 



(j.2 _ cr2)l/2 



(4) 



Assuming that the projected and real-space correlation func- 
tions follow power laws within the region of interest (r < 
10/i~^Mpc), the parameter for the real-space correlation 
function can be derived from the projected one by the fol- 
lowing expression: 



_ (Toy r(l/2)r((7-l)/2) 
r(7/2) 



(5) 



More specifically, we calculate ro and 7 by fitting a power 
law, H(CT)/cr = (a/cTo)""" , using the Levenberg-Marquardt 
nonlinear least-squares method. The parameters of the real- 
space correlation function are then given by 



ro = 00 



r(l/2)r((a„-l)/2) 



r(a^/2) 



(6) 
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Therefore, similar to the angular correlation function the 
projected correlation function can be used to determine the 
real-space clustering. We apply both methods to determine 
the real-space clustering strength based on the HIPASS and 
the ALFALFA surveys and compare the results. 

3.3 Estimator and random sampling 

Three different estimators are comm only used to determine 
the two-point correlation function (Davis fc H uchra 1982; 
lHainihonlll993:lLandy fc SzalavHTom In this work we use 
the iLandv fc Szalavl (liggd ) estimator as it reduces errors 
caused by edges and holes within a given catalogue. In par- 
ticular this is importa nt for the ALFALFA s urvey with the 
hole caused by M 87 jGiovanelh et al.llioOTi ') and the large 
edge effects due to the three strips. The estimator is of the 
form: 



C(r) 



DDjr) - 2DR{r) + RR{r) 
RR{r) 



(8) 



where r is the separation distance which has different mean- 
ings for the different correlation functions. For the angular 
correlation it denotes the separation angle, 6, for the pro- 
jected correlation function it is the projected distance, a, 
and for the real-space correlation it indicates the real-space 
distance r. DD{r) is the number of data-data pairs, DR(r) is 
the number of data-random pairs, and RR{r) is the number 
of random-random pairs all with separations r. 

The random catalogues were generated with uniform 
distributions on the sky and redshift distributions which re- 
semble the distribution of recessional velociti es in the survey 
smoo thed using kernel density estimation l|Wand fc Joned 
I1995I ). Throughout this work we use random samples that 
are equal in size compared to the corresponding data set. 
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Figure 2. Angular correlation functions for HIPASS and AL- 
FALFA. Error bars were calculated using jack-knife sampling. The 
solid red line and the small-dashed blue line show the correspond- 
ing power law fits for angles between 0.1 and 8° for ALFALFA and 
between 1 and 10° for the HIPASS data. The projected clustering 
of dark matter (in a ACDM model) with redshift distributions of 
HIPASS and ALFALFA are shown by the green dashed line and 
the magenta dotted line respectively 



We repeat the random catalogue generation 20 times and 
the calculation of random pairs in order to reduce the vari- 
ance from the random sampling. 



3.4 The angular correlation function of dark 
matter 

Based on Limber's equation, the redshift distributions of 
HIPASS and ALFALFA and the expression for the non - 
linear power spectrum discussed in lPeacock fc Doddsl (|l996l l 
we predict the angular correlation function of dark matter 
using the cosmological parameters given in iKomatsu et al.l 
(120091) . The bias parameter, b, at various angles is then de- 
termined by 



OJHI 



^darkmatter 



(9) 



4 RESULTS 

4.1 The full HIPASS and ALFALFA samples 

4-1.1 Angular correlation functions 

Figure [2] shows the angular correlation functions for HIPASS 
and ALFALFA data as well as the predicted correlation func- 
tions of cold dark matter weighted with the redshift distri- 
butions of the surveys. The straight lines are power law fits 
for pair separations in the range between 0.1 and 8° for AL- 
FALFA and between 1 and 10° for HIPASS. The effect of 
source confusion in HIPASS is evident at smaller angular 
scales. The corresponding parameters, 6^0 and ae, are given 
in Table [U 

The measured slopes in the two surveys agree reason- 
ably well. As expected, the value of 9o is lower for ALFALFA 
since it is deeper than HIPASS and the clustering in 3-D 
is washed out in the 2-D projection. ALFALFA also detects 
galaxies with lower HI masses which are potentially less clus- 
tered. 
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HIPASS 



ALFALFA 



do 0.603 ± 0.04° 0.044 ± 0.013° 
ae 0.56 ±0.02 0.59 ± 0.06 

Table 1. The angular clustering fitted parameters, 6o and • 



HIPASS 



ALFALFA 



o-Q 6.29 ± 0.36 fe-iMpc 5.34 ± 1.08 h-^Mpc 
1.62 ±0.04 1.68 ±0.13 

Table 2. The projected clustering fitted parameters, <jq and 
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4-1.2 Projected correlation functions 

Figure |3] shows the projected correlation functions for the 
HIPASS and ALFALFA surveys. The lines represent power 
law fits and the corresponding parameters are presented in 
Table [2] Once again the slopes, a^, agree well while the 
amplitude of clustering, oq, in ALFALFA is lower (although 
the uncertainties are fairly large). 

4-1.3 Inferred spatial correlations 

TableOshows the spatial correlation function parameters in- 
ferred from the angular and projected correlation functions 
obtained using Eq.[T]and Eq.|6l The subscripts, 9 and u, indi- 
cate which correlation function has been used to derive these 
parameters. The two values obtained for ro in HIPASS are 
within 2(T of each other and a gree wel l with the unweighted 
value of 2.7 obtained bv M ever et all lj2004 ). The two AL- 
FALFA values are consistent with each-other and indicate 
somewhat lower clustering than HIPASS. 

4-1-4 Bias estimation 

The predicted angular correlation function of dark matter is 
compared with our results in Figure O We have calculated 
the bias for the two surveys at each data point in the plot. 
For HIPASS, in the 1 — 10° range, we find bias values rang- 
ing from 0.54 to 0.70, with an average of 0.63. This is fairly 
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Figure 3. Projected correlation functions for HIPASS and AL- 
FALFA. Error bars are calculated using jack-knife sampling. Lines 
show the corresponding power law fits for separations between 
0.1 and 3.5 ft-^Mpc for ALFALFA and 0.2 and 8 fc-lMpc for the 
HIPASS data. 



Figure 4. Comparison of the projected correlation functions 
of the three strips of the current ALFALFA survey as well as 
the correlation function measured when ALFALFA data between 
0.02 < z < 0.03 is excluded (that is, when cluster galaxies in the 
Coma and Perseus- Pisces regions are excluded) The data points 
for the three strips are slightly offset to improve readability. Error 
bars are calculated using jack-knife sampling. 

consistent with the value of 0.68 obtained bv lBasilakos et al] 
(2007,). For ALFALFA, on the same angular scales, the val- 
ues range between 0.41 and 0.62, with an average of 0.52. 
Our results thus indicate that the ALFALFA sample is some- 
what more anti-biased than the HIPASS sample. This is con- 
sistent with the idea that ALFALFA includes galaxies with 
lower HI mass which are less clustered than the higher mass 
galaxies detected in HIPASS. We note, however, that the 
lower values are found at large scales where the narrowness 
of the strips may effect measurements more severely. 

4.2 Correlation functions of different ALFALFA 
subsamples 

4-2-1 Flux and HI mass subsets 

The ALFALFA data has been subdivided into two equivalent 
parts based on the flux of the sources. For these subsam- 
ples the angular and projected correlation functions were 
recalculated as des cribed in section [S] In agreement with 
iMever et al] l|2007l ) we flnd that the two correlation func- 
tions compare well with each other and with the correlation 
function of the whole data set indicating a negligible depen- 
dence of clustering on HI flux. 

We also split the samples evenly into high and low HI- 
mass subsamples. The clustering parameters obtained are 
shown in Ta ble [3l Our results for HIPASS are consistent 
with those of I Mever et al.l l|2007f ). indicating that the galax- 
ies with higher Hl-masses are more clustered. Interestingly, 
the same trend is not apparent in the ALFALFA survey but 
the uncertainties are fairly large. 

We did not attempt to separate the galaxies according 
to their rotation velocities as this requires additional data 
to estimate inclinations. 

4-2-2 Small Field effects 

The Virgo regions contain over-densities of galaxies that 
are associated with the Virgo and Coma clusters. There is 
the concern that the results will be biased by the presence 
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Table 3. The real-space clustering parameters, rg and 7 derived 
from the angular correlation function (indicated by subscript 6) 
and from the projected correlation function (indicated by sub- 
script cr. 

of such dominant large-scale structure within the relatively 
small survey fields. To investigate this, the correlation func- 
tions of the three regions were calculated separately and are 
shown in Figure |4] Measurements in the three regions agree 
to within their uncertainties, indicating that the presence 
of the big clusters within the Virgo regions do not effect 
the results significantly. We note however, that the Anti- 
Virgo region is near the Perseus-Pisces supercluster which 
causes a slight over-density in that field at a similar red- 
shift {z ~ 0.025). To be sure that over-densities in all three 
fields at this redshift were not biasing our results, we cut 
the galaxies with redshifts between ~ 0.02 and 0.03 out of 
the samples and recalculated the correlation functions. The 
results are also shown in Figure |3] and it is clear that the 
correlation functions with and without the redshift cuts are 
completely consistent within the uncertainties. 

As an additional check on the effect of the small fields on 
the measur ed clustering s t rength, we calculated the integral 
constraint (IPeeblesl llQSOl : iRatcliffe et al.|[l998l ') for a single 
field, and obtained a value of 0.145 which indicates an effect 
■within the uncertainty of the correlation function derived 
from the ALFALFA data. 



5 SUMMARY AND CONCLUSIONS 

We have measured the clustering of HLselected galaxies us- 
ing the ALFALFA survey data and compared this with re- 
sults for HIPASS. Our two methods for determining the 
real-space correlation function agree well and our results 
for HIPASS agree with those found bv lMever et al.l l|2004l ). 
The real-space clustering in ALFALFA appears to be even 
lower than in HIPASS, consistent with the idea that AL- 
FALFA probes galaxies with lower Hl-masses that are less 
clustered than their high-mass counterparts. Our measure- 
ments of high- and low-mass subsamples in ALFALFA do 
not provide evidence to support this idea but the uncertain- 
ties on the measurements are large. 

We have calculated the clustering of dark matter ex- 
pected within a ACDM model with redshift distributions 
of HIPASS and ALFALFA. We then calculated the bias of 
ALFALFA sources over the range 1 — 10° , finding a value of 



0.62 at 1° and an average value of 0.52 over the whole range. 
The significant anti-bias of galaxies with low Hl-mass is im- 
portant to consider when estimating the signal-to-noise of 
experiments planned for the SKA and its pathfinders. 
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